Overview

Dataset statistics

Number of variables23
Number of observations1539
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory184.5 KiB
Average record size in memory122.7 B

Variable types

Numeric13
Categorical10

Warnings

iclevel has constant value "Four or more years" Constant
instnm has a high cardinality: 1526 distinct values High cardinality
stabbr has a high cardinality: 54 distinct values High cardinality
admssn is highly correlated with enrlft and 1 other fieldsHigh correlation
enrlft is highly correlated with admssn and 1 other fieldsHigh correlation
enrlt is highly correlated with admssn and 1 other fieldsHigh correlation
instcat is highly correlated with iclevelHigh correlation
control is highly correlated with sector and 1 other fieldsHigh correlation
instsize is highly correlated with iclevelHigh correlation
sector is highly correlated with control and 1 other fieldsHigh correlation
alloncam is highly correlated with iclevelHigh correlation
locale is highly correlated with iclevelHigh correlation
iclevel is highly correlated with instcat and 7 other fieldsHigh correlation
stabbr is highly correlated with iclevelHigh correlation
c15basic is highly correlated with iclevelHigh correlation
instnm is uniformly distributed Uniform
unitid has unique values Unique
latitude has unique values Unique
applfeeu has 445 (28.9%) zeros Zeros

Reproduction

Analysis started2021-02-27 21:51:11.902692
Analysis finished2021-02-27 21:51:31.314916
Duration19.41 seconds
Software versionpandas-profiling v2.11.0
Download configurationconfig.yaml

Variables

unitid
Real number (ℝ≥0)

UNIQUE

Distinct1539
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean193321.6407
Minimum100654
Maximum491057
Zeros0
Zeros (%)0.0%
Memory size12.1 KiB

Quantile statistics

Minimum100654
5-th percentile110713.1
Q1154292.5
median188030
Q3215673
95-th percentile243465.2
Maximum491057
Range390403
Interquartile range (IQR)61380.5

Descriptive statistics

Standard deviation67928.81782
Coefficient of variation (CV)0.3513772053
Kurtosis8.078420319
Mean193321.6407
Median Absolute Deviation (MAD)30273
Skewness2.481751279
Sum297522005
Variance4614324290
MonotocityStrictly increasing
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1699831
 
0.1%
1665131
 
0.1%
1235541
 
0.1%
1481311
 
0.1%
1399401
 
0.1%
1276531
 
0.1%
2334261
 
0.1%
1890971
 
0.1%
1952431
 
0.1%
4541841
 
0.1%
Other values (1529)1529
99.4%
ValueCountFrequency (%)
1006541
0.1%
1006631
0.1%
1007061
0.1%
1007241
0.1%
1007511
0.1%
ValueCountFrequency (%)
4910571
0.1%
4908051
0.1%
4905131
0.1%
4905041
0.1%
4903191
0.1%

instnm
Categorical

HIGH CARDINALITY
UNIFORM

Distinct1526
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
Westminster College
 
3
Union College
 
3
Marian University
 
2
Bethany College
 
2
Emmanuel College
 
2
Other values (1521)
1527 

Length

Max length75
Median length24
Mean length25.25276153
Min length6

Characters and Unicode

Total characters38864
Distinct characters60
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1515 ?
Unique (%)98.4%

Sample

1st rowAlabama A & M University
2nd rowUniversity of Alabama at Birmingham
3rd rowUniversity of Alabama in Huntsville
4th rowAlabama State University
5th rowThe University of Alabama
ValueCountFrequency (%)
Westminster College3
 
0.2%
Union College3
 
0.2%
Marian University2
 
0.1%
Bethany College2
 
0.1%
Emmanuel College2
 
0.1%
Sterling College2
 
0.1%
Anderson University2
 
0.1%
University of St Thomas2
 
0.1%
Bethel University2
 
0.1%
St. John's College2
 
0.1%
Other values (1516)1517
98.6%
Histogram of lengths of the category
ValueCountFrequency (%)
university846
 
17.1%
college477
 
9.7%
of373
 
7.5%
state193
 
3.9%
the64
 
1.3%
at50
 
1.0%
saint39
 
0.8%
institute38
 
0.8%
and36
 
0.7%
new32
 
0.6%
Other values (1382)2794
56.5%

Most occurring characters

ValueCountFrequency (%)
e3934
 
10.1%
i3496
 
9.0%
3404
 
8.8%
n2706
 
7.0%
t2643
 
6.8%
a2255
 
5.8%
r2244
 
5.8%
o2210
 
5.7%
s2044
 
5.3%
l1987
 
5.1%
Other values (50)11941
30.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter30457
78.4%
Uppercase Letter4730
 
12.2%
Space Separator3404
 
8.8%
Dash Punctuation211
 
0.5%
Other Punctuation59
 
0.2%
Open Punctuation1
 
< 0.1%
Math Symbol1
 
< 0.1%
Close Punctuation1
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
U997
21.1%
C835
17.7%
S516
10.9%
M329
 
7.0%
A210
 
4.4%
T177
 
3.7%
B164
 
3.5%
N164
 
3.5%
W156
 
3.3%
P155
 
3.3%
Other values (16)1027
21.7%
ValueCountFrequency (%)
e3934
12.9%
i3496
11.5%
n2706
8.9%
t2643
8.7%
a2255
 
7.4%
r2244
 
7.4%
o2210
 
7.3%
s2044
 
6.7%
l1987
 
6.5%
y1293
 
4.2%
Other values (16)5645
18.5%
ValueCountFrequency (%)
&26
44.1%
'26
44.1%
.7
 
11.9%
ValueCountFrequency (%)
3404
100.0%
ValueCountFrequency (%)
-211
100.0%
ValueCountFrequency (%)
(1
100.0%
ValueCountFrequency (%)
+1
100.0%
ValueCountFrequency (%)
)1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin35187
90.5%
Common3677
 
9.5%

Most frequent character per script

ValueCountFrequency (%)
e3934
 
11.2%
i3496
 
9.9%
n2706
 
7.7%
t2643
 
7.5%
a2255
 
6.4%
r2244
 
6.4%
o2210
 
6.3%
s2044
 
5.8%
l1987
 
5.6%
y1293
 
3.7%
Other values (42)10375
29.5%
ValueCountFrequency (%)
3404
92.6%
-211
 
5.7%
&26
 
0.7%
'26
 
0.7%
.7
 
0.2%
(1
 
< 0.1%
+1
 
< 0.1%
)1
 
< 0.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII38864
100.0%

Most frequent character per block

ValueCountFrequency (%)
e3934
 
10.1%
i3496
 
9.0%
3404
 
8.8%
n2706
 
7.0%
t2643
 
6.8%
a2255
 
5.8%
r2244
 
5.8%
o2210
 
5.7%
s2044
 
5.3%
l1987
 
5.1%
Other values (50)11941
30.7%

stabbr
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct54
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Memory size12.1 KiB
NY
145 
PA
112 
CA
 
93
MA
 
68
TX
 
67
Other values (49)
1054 

Length

Max length2
Median length2
Mean length2
Min length2

Characters and Unicode

Total characters3078
Distinct characters24
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique3 ?
Unique (%)0.2%

Sample

1st rowAL
2nd rowAL
3rd rowAL
4th rowAL
5th rowAL
ValueCountFrequency (%)
NY145
 
9.4%
PA112
 
7.3%
CA93
 
6.0%
MA68
 
4.4%
TX67
 
4.4%
OH62
 
4.0%
IL57
 
3.7%
NC55
 
3.6%
GA46
 
3.0%
FL44
 
2.9%
Other values (44)790
51.3%
Histogram of lengths of the category
ValueCountFrequency (%)
ny145
 
9.4%
pa112
 
7.3%
ca93
 
6.0%
ma68
 
4.4%
tx67
 
4.4%
oh62
 
4.0%
il57
 
3.7%
nc55
 
3.6%
ga46
 
3.0%
fl44
 
2.9%
Other values (44)790
51.3%

Most occurring characters

ValueCountFrequency (%)
A481
15.6%
N398
12.9%
M259
 
8.4%
I223
 
7.2%
C219
 
7.1%
Y173
 
5.6%
O162
 
5.3%
T155
 
5.0%
L144
 
4.7%
P118
 
3.8%
Other values (14)746
24.2%

Most occurring categories

ValueCountFrequency (%)
Uppercase Letter3078
100.0%

Most frequent character per category

ValueCountFrequency (%)
A481
15.6%
N398
12.9%
M259
 
8.4%
I223
 
7.2%
C219
 
7.1%
Y173
 
5.6%
O162
 
5.3%
T155
 
5.0%
L144
 
4.7%
P118
 
3.8%
Other values (14)746
24.2%

Most occurring scripts

ValueCountFrequency (%)
Latin3078
100.0%

Most frequent character per script

ValueCountFrequency (%)
A481
15.6%
N398
12.9%
M259
 
8.4%
I223
 
7.2%
C219
 
7.1%
Y173
 
5.6%
O162
 
5.3%
T155
 
5.0%
L144
 
4.7%
P118
 
3.8%
Other values (14)746
24.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII3078
100.0%

Most frequent character per block

ValueCountFrequency (%)
A481
15.6%
N398
12.9%
M259
 
8.4%
I223
 
7.2%
C219
 
7.1%
Y173
 
5.6%
O162
 
5.3%
T155
 
5.0%
L144
 
4.7%
P118
 
3.8%
Other values (14)746
24.2%

sector
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Private not-for-profit, 4-year or above
1031 
Public, 4-year or above
508 

Length

Max length39
Median length39
Mean length33.71864847
Min length23

Characters and Unicode

Total characters51893
Distinct characters20
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPublic, 4-year or above
2nd rowPublic, 4-year or above
3rd rowPublic, 4-year or above
4th rowPublic, 4-year or above
5th rowPublic, 4-year or above
ValueCountFrequency (%)
Private not-for-profit, 4-year or above1031
67.0%
Public, 4-year or above508
33.0%
Histogram of lengths of the category
ValueCountFrequency (%)
above1539
21.4%
or1539
21.4%
4-year1539
21.4%
not-for-profit1031
14.3%
private1031
14.3%
public508
 
7.1%

Most occurring characters

ValueCountFrequency (%)
r6171
11.9%
o6171
11.9%
5648
10.9%
e4109
 
7.9%
a4109
 
7.9%
-3601
 
6.9%
t3093
 
6.0%
i2570
 
5.0%
v2570
 
5.0%
f2062
 
4.0%
Other values (10)11789
22.7%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter38027
73.3%
Space Separator5648
 
10.9%
Dash Punctuation3601
 
6.9%
Uppercase Letter1539
 
3.0%
Other Punctuation1539
 
3.0%
Decimal Number1539
 
3.0%

Most frequent character per category

ValueCountFrequency (%)
r6171
16.2%
o6171
16.2%
e4109
10.8%
a4109
10.8%
t3093
8.1%
i2570
6.8%
v2570
6.8%
f2062
 
5.4%
b2047
 
5.4%
y1539
 
4.0%
Other values (5)3586
9.4%
ValueCountFrequency (%)
P1539
100.0%
ValueCountFrequency (%)
,1539
100.0%
ValueCountFrequency (%)
5648
100.0%
ValueCountFrequency (%)
41539
100.0%
ValueCountFrequency (%)
-3601
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin39566
76.2%
Common12327
 
23.8%

Most frequent character per script

ValueCountFrequency (%)
r6171
15.6%
o6171
15.6%
e4109
10.4%
a4109
10.4%
t3093
7.8%
i2570
6.5%
v2570
6.5%
f2062
 
5.2%
b2047
 
5.2%
P1539
 
3.9%
Other values (6)5125
13.0%
ValueCountFrequency (%)
5648
45.8%
-3601
29.2%
,1539
 
12.5%
41539
 
12.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII51893
100.0%

Most frequent character per block

ValueCountFrequency (%)
r6171
11.9%
o6171
11.9%
5648
10.9%
e4109
 
7.9%
a4109
 
7.9%
-3601
 
6.9%
t3093
 
6.0%
i2570
 
5.0%
v2570
 
5.0%
f2062
 
4.0%
Other values (10)11789
22.7%

iclevel
Categorical

CONSTANT
HIGH CORRELATION
REJECTED

Distinct1
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Four or more years
1539 

Length

Max length18
Median length18
Mean length18
Min length18

Characters and Unicode

Total characters27702
Distinct characters10
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFour or more years
2nd rowFour or more years
3rd rowFour or more years
4th rowFour or more years
5th rowFour or more years
ValueCountFrequency (%)
Four or more years1539
100.0%
Histogram of lengths of the category
ValueCountFrequency (%)
more1539
25.0%
or1539
25.0%
four1539
25.0%
years1539
25.0%

Most occurring characters

ValueCountFrequency (%)
r6156
22.2%
o4617
16.7%
4617
16.7%
e3078
11.1%
F1539
 
5.6%
u1539
 
5.6%
m1539
 
5.6%
y1539
 
5.6%
a1539
 
5.6%
s1539
 
5.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter21546
77.8%
Space Separator4617
 
16.7%
Uppercase Letter1539
 
5.6%

Most frequent character per category

ValueCountFrequency (%)
r6156
28.6%
o4617
21.4%
e3078
14.3%
u1539
 
7.1%
m1539
 
7.1%
y1539
 
7.1%
a1539
 
7.1%
s1539
 
7.1%
ValueCountFrequency (%)
F1539
100.0%
ValueCountFrequency (%)
4617
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin23085
83.3%
Common4617
 
16.7%

Most frequent character per script

ValueCountFrequency (%)
r6156
26.7%
o4617
20.0%
e3078
13.3%
F1539
 
6.7%
u1539
 
6.7%
m1539
 
6.7%
y1539
 
6.7%
a1539
 
6.7%
s1539
 
6.7%
ValueCountFrequency (%)
4617
100.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII27702
100.0%

Most frequent character per block

ValueCountFrequency (%)
r6156
22.2%
o4617
16.7%
4617
16.7%
e3078
11.1%
F1539
 
5.6%
u1539
 
5.6%
m1539
 
5.6%
y1539
 
5.6%
a1539
 
5.6%
s1539
 
5.6%

control
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
Private not-for-profit
1031 
Public
508 

Length

Max length22
Median length22
Mean length16.71864847
Min length6

Characters and Unicode

Total characters25730
Distinct characters17
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPublic
2nd rowPublic
3rd rowPublic
4th rowPublic
5th rowPublic
ValueCountFrequency (%)
Private not-for-profit1031
67.0%
Public508
33.0%
Histogram of lengths of the category
ValueCountFrequency (%)
private1031
40.1%
not-for-profit1031
40.1%
public508
19.8%

Most occurring characters

ValueCountFrequency (%)
r3093
12.0%
t3093
12.0%
o3093
12.0%
i2570
10.0%
-2062
8.0%
f2062
8.0%
P1539
 
6.0%
v1031
 
4.0%
a1031
 
4.0%
e1031
 
4.0%
Other values (7)5125
19.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter21098
82.0%
Dash Punctuation2062
 
8.0%
Uppercase Letter1539
 
6.0%
Space Separator1031
 
4.0%

Most frequent character per category

ValueCountFrequency (%)
r3093
14.7%
t3093
14.7%
o3093
14.7%
i2570
12.2%
f2062
9.8%
v1031
 
4.9%
a1031
 
4.9%
e1031
 
4.9%
n1031
 
4.9%
p1031
 
4.9%
Other values (4)2032
9.6%
ValueCountFrequency (%)
P1539
100.0%
ValueCountFrequency (%)
1031
100.0%
ValueCountFrequency (%)
-2062
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin22637
88.0%
Common3093
 
12.0%

Most frequent character per script

ValueCountFrequency (%)
r3093
13.7%
t3093
13.7%
o3093
13.7%
i2570
11.4%
f2062
9.1%
P1539
6.8%
v1031
 
4.6%
a1031
 
4.6%
e1031
 
4.6%
n1031
 
4.6%
Other values (5)3063
13.5%
ValueCountFrequency (%)
-2062
66.7%
1031
33.3%

Most occurring blocks

ValueCountFrequency (%)
ASCII25730
100.0%

Most frequent character per block

ValueCountFrequency (%)
r3093
12.0%
t3093
12.0%
o3093
12.0%
i2570
10.0%
-2062
8.0%
f2062
8.0%
P1539
 
6.0%
v1031
 
4.0%
a1031
 
4.0%
e1031
 
4.0%
Other values (7)5125
19.9%

locale
Categorical

HIGH CORRELATION

Distinct12
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size2.4 KiB
City: Large
340 
Suburb: Large
301 
City: Small
209 
Town: Distant
179 
City: Midsize
172 
Other values (7)
338 

Length

Max length15
Median length13
Mean length12.2605588
Min length11

Characters and Unicode

Total characters18869
Distinct characters27
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCity: Midsize
2nd rowCity: Midsize
3rd rowCity: Midsize
4th rowCity: Midsize
5th rowCity: Small
ValueCountFrequency (%)
City: Large340
22.1%
Suburb: Large301
19.6%
City: Small209
13.6%
Town: Distant179
11.6%
City: Midsize172
11.2%
Town: Remote109
 
7.1%
Town: Fringe62
 
4.0%
Suburb: Midsize51
 
3.3%
Rural: Fringe40
 
2.6%
Suburb: Small31
 
2.0%
Other values (2)45
 
2.9%
Histogram of lengths of the category
ValueCountFrequency (%)
city721
23.4%
large641
20.8%
suburb383
12.4%
town350
11.4%
small240
 
7.8%
midsize223
 
7.2%
distant208
 
6.8%
remote125
 
4.1%
fringe102
 
3.3%
rural85
 
2.8%

Most occurring characters

ValueCountFrequency (%)
:1539
 
8.2%
1539
 
8.2%
i1477
 
7.8%
t1262
 
6.7%
e1216
 
6.4%
r1211
 
6.4%
a1174
 
6.2%
u851
 
4.5%
b766
 
4.1%
g743
 
3.9%
Other values (17)7091
37.6%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter12713
67.4%
Uppercase Letter3078
 
16.3%
Other Punctuation1539
 
8.2%
Space Separator1539
 
8.2%

Most frequent character per category

ValueCountFrequency (%)
i1477
11.6%
t1262
9.9%
e1216
9.6%
r1211
9.5%
a1174
9.2%
u851
 
6.7%
b766
 
6.0%
g743
 
5.8%
y721
 
5.7%
n660
 
5.2%
Other values (7)2632
20.7%
ValueCountFrequency (%)
C721
23.4%
L641
20.8%
S623
20.2%
T350
11.4%
M223
 
7.2%
R210
 
6.8%
D208
 
6.8%
F102
 
3.3%
ValueCountFrequency (%)
:1539
100.0%
ValueCountFrequency (%)
1539
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin15791
83.7%
Common3078
 
16.3%

Most frequent character per script

ValueCountFrequency (%)
i1477
 
9.4%
t1262
 
8.0%
e1216
 
7.7%
r1211
 
7.7%
a1174
 
7.4%
u851
 
5.4%
b766
 
4.9%
g743
 
4.7%
C721
 
4.6%
y721
 
4.6%
Other values (15)5649
35.8%
ValueCountFrequency (%)
:1539
50.0%
1539
50.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII18869
100.0%

Most frequent character per block

ValueCountFrequency (%)
:1539
 
8.2%
1539
 
8.2%
i1477
 
7.8%
t1262
 
6.7%
e1216
 
6.4%
r1211
 
6.4%
a1174
 
6.2%
u851
 
4.5%
b766
 
4.1%
g743
 
3.9%
Other values (17)7091
37.6%

instcat
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
Degree-granting, primarily baccalaureate or above
1502 
Degree-granting, not primarily baccalaureate or above
 
37

Length

Max length53
Median length49
Mean length49.09616634
Min length49

Characters and Unicode

Total characters75559
Distinct characters20
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowDegree-granting, primarily baccalaureate or above
2nd rowDegree-granting, primarily baccalaureate or above
3rd rowDegree-granting, primarily baccalaureate or above
4th rowDegree-granting, primarily baccalaureate or above
5th rowDegree-granting, primarily baccalaureate or above
ValueCountFrequency (%)
Degree-granting, primarily baccalaureate or above1502
97.6%
Degree-granting, not primarily baccalaureate or above37
 
2.4%
Histogram of lengths of the category
ValueCountFrequency (%)
degree-granting1539
19.9%
primarily1539
19.9%
above1539
19.9%
or1539
19.9%
baccalaureate1539
19.9%
not37
 
0.5%

Most occurring characters

ValueCountFrequency (%)
a10773
14.3%
e9234
12.2%
r9234
12.2%
6193
 
8.2%
g4617
 
6.1%
i4617
 
6.1%
n3115
 
4.1%
t3115
 
4.1%
o3115
 
4.1%
l3078
 
4.1%
Other values (10)18468
24.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter64749
85.7%
Space Separator6193
 
8.2%
Uppercase Letter1539
 
2.0%
Dash Punctuation1539
 
2.0%
Other Punctuation1539
 
2.0%

Most frequent character per category

ValueCountFrequency (%)
a10773
16.6%
e9234
14.3%
r9234
14.3%
g4617
7.1%
i4617
7.1%
n3115
 
4.8%
t3115
 
4.8%
o3115
 
4.8%
l3078
 
4.8%
b3078
 
4.8%
Other values (6)10773
16.6%
ValueCountFrequency (%)
D1539
100.0%
ValueCountFrequency (%)
-1539
100.0%
ValueCountFrequency (%)
,1539
100.0%
ValueCountFrequency (%)
6193
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin66288
87.7%
Common9271
 
12.3%

Most frequent character per script

ValueCountFrequency (%)
a10773
16.3%
e9234
13.9%
r9234
13.9%
g4617
 
7.0%
i4617
 
7.0%
n3115
 
4.7%
t3115
 
4.7%
o3115
 
4.7%
l3078
 
4.6%
b3078
 
4.6%
Other values (7)12312
18.6%
ValueCountFrequency (%)
6193
66.8%
-1539
 
16.6%
,1539
 
16.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII75559
100.0%

Most frequent character per block

ValueCountFrequency (%)
a10773
14.3%
e9234
12.2%
r9234
12.2%
6193
 
8.2%
g4617
 
6.1%
i4617
 
6.1%
n3115
 
4.1%
t3115
 
4.1%
o3115
 
4.1%
l3078
 
4.1%
Other values (10)18468
24.4%

c15basic
Categorical

HIGH CORRELATION

Distinct19
Distinct (%)1.2%
Missing0
Missing (%)0.0%
Memory size3.1 KiB
Master^s Colleges & Universities: Larger Programs
332 
Baccalaureate Colleges: Arts & Sciences Focus
231 
Baccalaureate Colleges: Diverse Fields
199 
Master^s Colleges & Universities: Medium Programs
166 
Doctoral Universities: Highest Research Activity
114 
Other values (14)
497 

Length

Max length79
Median length49
Mean length47.63287849
Min length15

Characters and Unicode

Total characters73307
Distinct characters48
Distinct categories8 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st rowMaster^s Colleges & Universities: Larger Programs
2nd rowDoctoral Universities: Highest Research Activity
3rd rowDoctoral Universities: Higher Research Activity
4th rowMaster^s Colleges & Universities: Medium Programs
5th rowDoctoral Universities: Higher Research Activity
ValueCountFrequency (%)
Master^s Colleges & Universities: Larger Programs332
21.6%
Baccalaureate Colleges: Arts & Sciences Focus231
15.0%
Baccalaureate Colleges: Diverse Fields199
12.9%
Master^s Colleges & Universities: Medium Programs166
10.8%
Doctoral Universities: Highest Research Activity114
 
7.4%
Doctoral Universities: Higher Research Activity100
 
6.5%
Master^s Colleges & Universities: Small Programs98
 
6.4%
Special Focus Four-Year: Faith-Related Institutions97
 
6.3%
Doctoral Universities: Moderate Research Activity84
 
5.5%
Special Focus Four-Year: Arts, Music & Design Schools41
 
2.7%
Other values (9)77
 
5.0%
Histogram of lengths of the category
ValueCountFrequency (%)
colleges1053
 
12.3%
universities894
 
10.4%
876
 
10.2%
master^s596
 
7.0%
programs596
 
7.0%
baccalaureate430
 
5.0%
focus404
 
4.7%
larger332
 
3.9%
research298
 
3.5%
activity298
 
3.5%
Other values (39)2783
32.5%

Most occurring characters

ValueCountFrequency (%)
e8895
12.1%
7021
 
9.6%
s6936
 
9.5%
r5567
 
7.6%
i5060
 
6.9%
a4924
 
6.7%
t4009
 
5.5%
l3671
 
5.0%
o3316
 
4.5%
c3101
 
4.2%
Other values (38)20807
28.4%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter54963
75.0%
Uppercase Letter7855
 
10.7%
Space Separator7021
 
9.6%
Other Punctuation2501
 
3.4%
Modifier Symbol645
 
0.9%
Dash Punctuation286
 
0.4%
Open Punctuation18
 
< 0.1%
Close Punctuation18
 
< 0.1%

Most frequent character per category

ValueCountFrequency (%)
e8895
16.2%
s6936
12.6%
r5567
10.1%
i5060
9.2%
a4924
9.0%
t4009
7.3%
l3671
6.7%
o3316
 
6.0%
c3101
 
5.6%
g2328
 
4.2%
Other values (11)7156
13.0%
ValueCountFrequency (%)
C1071
13.6%
M918
11.7%
U894
11.4%
F870
11.1%
A619
7.9%
P612
7.8%
S572
7.3%
D539
6.9%
B485
6.2%
R395
 
5.0%
Other values (8)880
11.2%
ValueCountFrequency (%)
:1519
60.7%
&876
35.0%
,59
 
2.4%
/47
 
1.9%
ValueCountFrequency (%)
^645
100.0%
ValueCountFrequency (%)
7021
100.0%
ValueCountFrequency (%)
-286
100.0%
ValueCountFrequency (%)
(18
100.0%
ValueCountFrequency (%)
)18
100.0%

Most occurring scripts

ValueCountFrequency (%)
Latin62818
85.7%
Common10489
 
14.3%

Most frequent character per script

ValueCountFrequency (%)
e8895
14.2%
s6936
11.0%
r5567
 
8.9%
i5060
 
8.1%
a4924
 
7.8%
t4009
 
6.4%
l3671
 
5.8%
o3316
 
5.3%
c3101
 
4.9%
g2328
 
3.7%
Other values (29)15011
23.9%
ValueCountFrequency (%)
7021
66.9%
:1519
 
14.5%
&876
 
8.4%
^645
 
6.1%
-286
 
2.7%
,59
 
0.6%
/47
 
0.4%
(18
 
0.2%
)18
 
0.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII73307
100.0%

Most frequent character per block

ValueCountFrequency (%)
e8895
12.1%
7021
 
9.6%
s6936
 
9.5%
r5567
 
7.6%
i5060
 
6.9%
a4924
 
6.7%
t4009
 
5.5%
l3671
 
5.0%
o3316
 
4.5%
c3101
 
4.2%
Other values (38)20807
28.4%

instsize
Categorical

HIGH CORRELATION

Distinct5
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.0 KiB
1,000 - 4,999
714 
Under 1,000
293 
5,000 - 9,999
220 
10,000 - 19,999
163 
20,000 and above
149 

Length

Max length16
Median length13
Mean length13.12150747
Min length11

Characters and Unicode

Total characters20194
Distinct characters18
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row5,000 - 9,999
2nd row20,000 and above
3rd row5,000 - 9,999
4th row1,000 - 4,999
5th row20,000 and above
ValueCountFrequency (%)
1,000 - 4,999714
46.4%
Under 1,000293
19.0%
5,000 - 9,999220
 
14.3%
10,000 - 19,999163
 
10.6%
20,000 and above149
 
9.7%
Histogram of lengths of the category
ValueCountFrequency (%)
1097
25.4%
1,0001007
23.3%
4,999714
16.5%
under293
 
6.8%
9,999220
 
5.1%
5,000220
 
5.1%
19,999163
 
3.8%
10,000163
 
3.8%
20,000149
 
3.4%
above149
 
3.4%

Most occurring characters

ValueCountFrequency (%)
04929
24.4%
93674
18.2%
2785
13.8%
,2636
13.1%
11333
 
6.6%
-1097
 
5.4%
4714
 
3.5%
n442
 
2.2%
d442
 
2.2%
e442
 
2.2%
Other values (8)1700
 
8.4%

Most occurring categories

ValueCountFrequency (%)
Decimal Number11019
54.6%
Space Separator2785
 
13.8%
Other Punctuation2636
 
13.1%
Lowercase Letter2364
 
11.7%
Dash Punctuation1097
 
5.4%
Uppercase Letter293
 
1.5%

Most frequent character per category

ValueCountFrequency (%)
n442
18.7%
d442
18.7%
e442
18.7%
a298
12.6%
r293
12.4%
b149
 
6.3%
o149
 
6.3%
v149
 
6.3%
ValueCountFrequency (%)
04929
44.7%
93674
33.3%
11333
 
12.1%
4714
 
6.5%
5220
 
2.0%
2149
 
1.4%
ValueCountFrequency (%)
,2636
100.0%
ValueCountFrequency (%)
2785
100.0%
ValueCountFrequency (%)
-1097
100.0%
ValueCountFrequency (%)
U293
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common17537
86.8%
Latin2657
 
13.2%

Most frequent character per script

ValueCountFrequency (%)
04929
28.1%
93674
20.9%
2785
15.9%
,2636
15.0%
11333
 
7.6%
-1097
 
6.3%
4714
 
4.1%
5220
 
1.3%
2149
 
0.8%
ValueCountFrequency (%)
n442
16.6%
d442
16.6%
e442
16.6%
a298
11.2%
U293
11.0%
r293
11.0%
b149
 
5.6%
o149
 
5.6%
v149
 
5.6%

Most occurring blocks

ValueCountFrequency (%)
ASCII20194
100.0%

Most frequent character per block

ValueCountFrequency (%)
04929
24.4%
93674
18.2%
2785
13.8%
,2636
13.1%
11333
 
6.6%
-1097
 
5.4%
4714
 
3.5%
n442
 
2.2%
d442
 
2.2%
e442
 
2.2%
Other values (8)1700
 
8.4%

longitud
Real number (ℝ)

Distinct1538
Distinct (%)99.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-87.55845642
Minimum-157.92659
Maximum144.8358154
Zeros0
Zeros (%)0.0%
Memory size6.1 KiB

Quantile statistics

Minimum-157.92659
5-th percentile-120.4362846
Q1-93.73413086
median-84.06147766
Q3-76.49274445
95-th percentile-71.45867767
Maximum144.8358154
Range302.7624054
Interquartile range (IQR)17.24138641

Descriptive statistics

Standard deviation15.93539619
Coefficient of variation (CV)-0.1819972247
Kurtosis30.43753624
Mean-87.55845642
Median Absolute Deviation (MAD)8.427337646
Skewness0.8654874563
Sum-134752.4688
Variance253.9368439
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-78.637695312
 
0.1%
-72.249946591
 
0.1%
-82.835113531
 
0.1%
-117.58380891
 
0.1%
-72.583976751
 
0.1%
-75.18290711
 
0.1%
-73.670234681
 
0.1%
-73.83425141
 
0.1%
-92.084388731
 
0.1%
-80.240493771
 
0.1%
Other values (1528)1528
99.3%
ValueCountFrequency (%)
-157.926591
0.1%
-157.8883821
0.1%
-157.85964971
0.1%
-157.8189851
0.1%
-157.80784611
0.1%
ValueCountFrequency (%)
144.83581541
0.1%
-64.972862241
0.1%
-66.050010681
0.1%
-66.059616091
0.1%
-66.16206361
0.1%

latitude
Real number (ℝ≥0)

UNIQUE

Distinct1539
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean38.76544189
Minimum13.46504593
Maximum64.8575592
Zeros0
Zeros (%)0.0%
Memory size6.1 KiB

Quantile statistics

Minimum13.46504593
5-th percentile30.08745575
Q135.57273674
median40.00878143
Q341.95363808
95-th percentile44.978936
Maximum64.8575592
Range51.39251328
Interquartile range (IQR)6.380901337

Descriptive statistics

Standard deviation4.91193819
Coefficient of variation (CV)0.1267092079
Kurtosis2.61777854
Mean38.76544189
Median Absolute Deviation (MAD)2.633472443
Skewness-0.6972802877
Sum59660.01562
Variance24.12713623
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
42.249988561
 
0.1%
40.357986451
 
0.1%
42.667522431
 
0.1%
29.646289831
 
0.1%
40.292648321
 
0.1%
33.417720791
 
0.1%
47.666530611
 
0.1%
42.131587981
 
0.1%
40.448432921
 
0.1%
35.293235781
 
0.1%
Other values (1529)1529
99.4%
ValueCountFrequency (%)
13.465045931
0.1%
18.001649861
0.1%
18.083530431
0.1%
18.118820191
0.1%
18.20532991
0.1%
ValueCountFrequency (%)
64.85755921
0.1%
61.190967561
0.1%
61.190162661
0.1%
58.384845731
0.1%
48.737812041
0.1%

roomcap
Real number (ℝ≥0)

Distinct1211
Distinct (%)78.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1923.445094
Minimum1
Maximum18313
Zeros0
Zeros (%)0.0%
Memory size12.1 KiB

Quantile statistics

Minimum1
5-th percentile125
Q1556
median1156
Q32317.5
95-th percentile6597.3
Maximum18313
Range18312
Interquartile range (IQR)1761.5

Descriptive statistics

Standard deviation2339.018305
Coefficient of variation (CV)1.216056706
Kurtosis10.88861143
Mean1923.445094
Median Absolute Deviation (MAD)742
Skewness2.896746226
Sum2960182
Variance5471006.629
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10010
 
0.6%
1207
 
0.5%
6107
 
0.5%
4007
 
0.5%
2006
 
0.4%
5006
 
0.4%
12005
 
0.3%
6505
 
0.3%
14005
 
0.3%
1505
 
0.3%
Other values (1201)1476
95.9%
ValueCountFrequency (%)
11
0.1%
21
0.1%
41
0.1%
51
0.1%
101
0.1%
ValueCountFrequency (%)
183131
0.1%
180001
0.1%
170601
0.1%
159171
0.1%
151981
0.1%

applfeeu
Real number (ℝ≥0)

ZEROS

Distinct33
Distinct (%)2.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean31.85120208
Minimum0
Maximum200
Zeros445
Zeros (%)28.9%
Memory size12.1 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median30
Q350
95-th percentile75
Maximum200
Range200
Interquartile range (IQR)50

Descriptive statistics

Standard deviation25.6316171
Coefficient of variation (CV)0.8047299766
Kurtosis0.9531588794
Mean31.85120208
Median Absolute Deviation (MAD)20
Skewness0.4996573756
Sum49019
Variance656.9797954
MonotocityNot monotonic
Histogram with fixed size bins (bins=33)
ValueCountFrequency (%)
0445
28.9%
50234
15.2%
25159
 
10.3%
40111
 
7.2%
30107
 
7.0%
35103
 
6.7%
6554
 
3.5%
6054
 
3.5%
2050
 
3.2%
7546
 
3.0%
Other values (23)176
 
11.4%
ValueCountFrequency (%)
0445
28.9%
103
 
0.2%
156
 
0.4%
2050
 
3.2%
25159
 
10.3%
ValueCountFrequency (%)
2001
 
0.1%
1502
0.1%
1251
 
0.1%
1202
0.1%
1103
0.2%

applcn
Real number (ℝ≥0)

Distinct1407
Distinct (%)91.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6773.100065
Minimum3
Maximum102225
Zeros0
Zeros (%)0.0%
Memory size12.1 KiB

Quantile statistics

Minimum3
5-th percentile75.6
Q11259.5
median3217
Q37377.5
95-th percentile27680.5
Maximum102225
Range102222
Interquartile range (IQR)6118

Descriptive statistics

Standard deviation10407.8655
Coefficient of variation (CV)1.536647238
Kurtosis18.38907459
Mean6773.100065
Median Absolute Deviation (MAD)2418
Skewness3.650017185
Sum10423801
Variance108323664.4
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
54
 
0.3%
16934
 
0.3%
63
 
0.2%
303
 
0.2%
22313
 
0.2%
8733
 
0.2%
18433
 
0.2%
7463
 
0.2%
3883
 
0.2%
143
 
0.2%
Other values (1397)1507
97.9%
ValueCountFrequency (%)
33
0.2%
54
0.3%
63
0.2%
72
0.1%
82
0.1%
ValueCountFrequency (%)
1022251
0.1%
884461
0.1%
850921
0.1%
850441
0.1%
818241
0.1%

admssn
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1358
Distinct (%)88.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3792.214425
Minimum2
Maximum31878
Zeros0
Zeros (%)0.0%
Memory size12.1 KiB

Quantile statistics

Minimum2
5-th percentile48.9
Q1826.5
median2043
Q34600
95-th percentile14878.3
Maximum31878
Range31876
Interquartile range (IQR)3773.5

Descriptive statistics

Standard deviation4924.345735
Coefficient of variation (CV)1.298540953
Kurtosis7.273668739
Mean3792.214425
Median Absolute Deviation (MAD)1513
Skewness2.501910698
Sum5836218
Variance24249180.92
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
64
 
0.3%
34
 
0.3%
184
 
0.3%
74
 
0.3%
154
 
0.3%
8073
 
0.2%
25713
 
0.2%
2113
 
0.2%
8633
 
0.2%
13753
 
0.2%
Other values (1348)1504
97.7%
ValueCountFrequency (%)
21
 
0.1%
34
0.3%
41
 
0.1%
52
0.1%
64
0.3%
ValueCountFrequency (%)
318781
0.1%
310631
0.1%
307621
0.1%
300611
0.1%
298121
0.1%

enrlft
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1017
Distinct (%)66.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean980.5717999
Minimum0
Maximum10099
Zeros1
Zeros (%)0.1%
Memory size12.1 KiB

Quantile statistics

Minimum0
5-th percentile28
Q1218.5
median466
Q31123
95-th percentile4040.9
Maximum10099
Range10099
Interquartile range (IQR)904.5

Descriptive statistics

Standard deviation1340.696867
Coefficient of variation (CV)1.367260273
Kurtosis8.049878228
Mean980.5717999
Median Absolute Deviation (MAD)313
Skewness2.652212805
Sum1509100
Variance1797468.089
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
97
 
0.5%
2766
 
0.4%
2046
 
0.4%
2816
 
0.4%
36
 
0.4%
1715
 
0.3%
3415
 
0.3%
1645
 
0.3%
1735
 
0.3%
25
 
0.3%
Other values (1007)1483
96.4%
ValueCountFrequency (%)
01
 
0.1%
25
0.3%
36
0.4%
43
0.2%
51
 
0.1%
ValueCountFrequency (%)
100991
0.1%
82381
0.1%
81401
0.1%
79751
0.1%
78321
0.1%

enrlt
Real number (ℝ≥0)

HIGH CORRELATION

Distinct1030
Distinct (%)66.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1006.02729
Minimum2
Maximum11639
Zeros0
Zeros (%)0.0%
Memory size12.1 KiB

Quantile statistics

Minimum2
5-th percentile30
Q1220.5
median471
Q31146
95-th percentile4153.3
Maximum11639
Range11637
Interquartile range (IQR)925.5

Descriptive statistics

Standard deviation1385.874552
Coefficient of variation (CV)1.377571528
Kurtosis8.554906495
Mean1006.02729
Median Absolute Deviation (MAD)317
Skewness2.692328601
Sum1548276
Variance1920648.274
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1716
 
0.4%
96
 
0.4%
2036
 
0.4%
2796
 
0.4%
5505
 
0.3%
1835
 
0.3%
2045
 
0.3%
1975
 
0.3%
2675
 
0.3%
3385
 
0.3%
Other values (1020)1485
96.5%
ValueCountFrequency (%)
23
0.2%
35
0.3%
44
0.3%
52
 
0.1%
62
 
0.1%
ValueCountFrequency (%)
116391
0.1%
83811
0.1%
83661
0.1%
80011
0.1%
78741
0.1%

alloncam
Categorical

HIGH CORRELATION

Distinct2
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size1.8 KiB
2. No
1470 
1. Yes
 
69

Length

Max length6
Median length5
Mean length5.044834308
Min length5

Characters and Unicode

Total characters7764
Distinct characters9
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2. No
2nd row2. No
3rd row2. No
4th row2. No
5th row2. No
ValueCountFrequency (%)
2. No1470
95.5%
1. Yes69
 
4.5%
Histogram of lengths of the category
ValueCountFrequency (%)
no1470
47.8%
21470
47.8%
169
 
2.2%
yes69
 
2.2%

Most occurring characters

ValueCountFrequency (%)
.1539
19.8%
1539
19.8%
21470
18.9%
N1470
18.9%
o1470
18.9%
169
 
0.9%
Y69
 
0.9%
e69
 
0.9%
s69
 
0.9%

Most occurring categories

ValueCountFrequency (%)
Lowercase Letter1608
20.7%
Decimal Number1539
19.8%
Other Punctuation1539
19.8%
Space Separator1539
19.8%
Uppercase Letter1539
19.8%

Most frequent character per category

ValueCountFrequency (%)
o1470
91.4%
e69
 
4.3%
s69
 
4.3%
ValueCountFrequency (%)
21470
95.5%
169
 
4.5%
ValueCountFrequency (%)
N1470
95.5%
Y69
 
4.5%
ValueCountFrequency (%)
.1539
100.0%
ValueCountFrequency (%)
1539
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common4617
59.5%
Latin3147
40.5%

Most frequent character per script

ValueCountFrequency (%)
N1470
46.7%
o1470
46.7%
Y69
 
2.2%
e69
 
2.2%
s69
 
2.2%
ValueCountFrequency (%)
.1539
33.3%
1539
33.3%
21470
31.8%
169
 
1.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII7764
100.0%

Most frequent character per block

ValueCountFrequency (%)
.1539
19.8%
1539
19.8%
21470
18.9%
N1470
18.9%
o1470
18.9%
169
 
0.9%
Y69
 
0.9%
e69
 
0.9%
s69
 
0.9%

accept
Real number (ℝ≥0)

Distinct1494
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6670723287
Minimum0.01852300829
Maximum1
Zeros0
Zeros (%)0.0%
Memory size12.1 KiB

Quantile statistics

Minimum0.01852300829
5-th percentile0.2725023853
Q10.540513361
median0.6910588235
Q30.8191577816
95-th percentile0.9636980579
Maximum1
Range0.9814769917
Interquartile range (IQR)0.2786444206

Descriptive statistics

Standard deviation0.2070943895
Coefficient of variation (CV)0.310452676
Kurtosis0.1457240781
Mean0.6670723287
Median Absolute Deviation (MAD)0.1370029363
Skewness-0.6635352811
Sum1026.624314
Variance0.04288808616
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
128
 
1.8%
0.83333333333
 
0.2%
0.85714285713
 
0.2%
0.83
 
0.2%
0.94444444442
 
0.1%
0.8752
 
0.1%
0.92307692312
 
0.1%
0.74358974362
 
0.1%
0.93333333332
 
0.1%
0.9752
 
0.1%
Other values (1484)1490
96.8%
ValueCountFrequency (%)
0.018523008291
0.1%
0.033039647581
0.1%
0.047307875571
0.1%
0.051561788081
0.1%
0.059208136581
0.1%
ValueCountFrequency (%)
128
1.8%
0.99629629631
 
0.1%
0.99602272731
 
0.1%
0.99587584111
 
0.1%
0.99516908211
 
0.1%

yield
Real number (ℝ≥0)

Distinct1496
Distinct (%)97.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3287117759
Minimum0.0429477794
Maximum1
Zeros0
Zeros (%)0.0%
Memory size12.1 KiB

Quantile statistics

Minimum0.0429477794
5-th percentile0.1213353139
Q10.1992459731
median0.2788671024
Q30.3956059551
95-th percentile0.7846383742
Maximum1
Range0.9570522206
Interquartile range (IQR)0.196359982

Descriptive statistics

Standard deviation0.1884903004
Coefficient of variation (CV)0.5734211982
Kurtosis2.09307436
Mean0.3287117759
Median Absolute Deviation (MAD)0.09187153397
Skewness1.49479586
Sum505.887423
Variance0.03552859334
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
113
 
0.8%
0.85714285715
 
0.3%
0.754
 
0.3%
0.83333333334
 
0.3%
0.66666666674
 
0.3%
0.83
 
0.2%
0.53
 
0.2%
0.8753
 
0.2%
0.57142857143
 
0.2%
0.41666666673
 
0.2%
Other values (1486)1494
97.1%
ValueCountFrequency (%)
0.04294777941
0.1%
0.051987767581
0.1%
0.052310374891
0.1%
0.066125760651
0.1%
0.075723359211
0.1%
ValueCountFrequency (%)
113
0.8%
0.99009900991
 
0.1%
0.97580645161
 
0.1%
0.97039473681
 
0.1%
0.95575221241
 
0.1%

roomamtX
Real number (ℝ≥0)

Distinct1093
Distinct (%)71.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean6072.890838
Minimum0
Maximum16675
Zeros5
Zeros (%)0.3%
Memory size12.1 KiB

Quantile statistics

Minimum0
5-th percentile2909
Q14696
median5880
Q37385
95-th percentile9616
Maximum16675
Range16675
Interquartile range (IQR)2689

Descriptive statistics

Standard deviation2128.264602
Coefficient of variation (CV)0.3504532945
Kurtosis1.428709871
Mean6072.890838
Median Absolute Deviation (MAD)1344
Skewness0.5691028581
Sum9346179
Variance4529510.214
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
540011
 
0.7%
480010
 
0.6%
500010
 
0.6%
45009
 
0.6%
42009
 
0.6%
66009
 
0.6%
52009
 
0.6%
60008
 
0.5%
24008
 
0.5%
61808
 
0.5%
Other values (1083)1448
94.1%
ValueCountFrequency (%)
05
0.3%
4601
 
0.1%
9501
 
0.1%
10001
 
0.1%
12001
 
0.1%
ValueCountFrequency (%)
166751
0.1%
161151
0.1%
153001
0.1%
150002
0.1%
145941
0.1%

boardamtX
Real number (ℝ≥0)

Distinct986
Distinct (%)64.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4404.346979
Minimum0
Maximum8965
Zeros6
Zeros (%)0.4%
Memory size12.1 KiB

Quantile statistics

Minimum0
5-th percentile2199.24
Q13620
median4396
Q35242.5
95-th percentile6446.84
Maximum8965
Range8965
Interquartile range (IQR)1622.5

Descriptive statistics

Standard deviation1301.414316
Coefficient of variation (CV)0.2954840574
Kurtosis0.5323868277
Mean4404.346979
Median Absolute Deviation (MAD)804
Skewness-0.2210720052
Sum6778290
Variance1693679.221
MonotocityNot monotonic
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
300013
 
0.8%
510013
 
0.8%
400013
 
0.8%
360011
 
0.7%
32009
 
0.6%
20008
 
0.5%
37008
 
0.5%
44008
 
0.5%
46008
 
0.5%
38008
 
0.5%
Other values (976)1440
93.6%
ValueCountFrequency (%)
06
0.4%
4801
 
0.1%
4901
 
0.1%
5561
 
0.1%
6001
 
0.1%
ValueCountFrequency (%)
89651
0.1%
84801
0.1%
83301
0.1%
82101
0.1%
80381
0.1%

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

A simple visualization of nullity by column.
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

First rows

unitidinstnmstabbrsectoriclevelcontrollocaleinstcatc15basicinstsizelongitudlatituderoomcapapplfeeuapplcnadmssnenrlftenrltalloncamacceptyieldroomamtXboardamtX
0100654Alabama A & M UniversityALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Larger Programs5,000 - 9,999-86.56850434.7833672614.030.08610.07772.01288.01294.02. No0.9026710.1664955400.03620.0
1100663University of Alabama at BirminghamALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveDoctoral Universities: Highest Research Activity20,000 and above-86.79934733.5056952785.030.07555.06936.02228.02299.02. No0.9180680.3314597532.04150.0
2100706University of Alabama in HuntsvilleALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveDoctoral Universities: Higher Research Activity5,000 - 9,999-86.64045034.7245561652.030.04454.03618.01341.01352.02. No0.8123040.3736875848.83899.2
3100724Alabama State UniversityALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Medium Programs1,000 - 4,999-86.29567732.3643192491.025.06842.06696.0951.0967.02. No0.9786610.1444153346.02076.0
4100751The University of AlabamaALPublic, 4-year or aboveFour or more yearsPublicCity: SmallDegree-granting, primarily baccalaureate or aboveDoctoral Universities: Higher Research Activity20,000 and above-87.54597533.2118768449.040.038129.020321.07385.07407.02. No0.5329540.3645005750.03674.0
5100830Auburn University at MontgomeryALPublic, 4-year or aboveFour or more yearsPublicCity: MidsizeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Larger Programs1,000 - 4,999-86.17754432.3673591200.00.02454.02022.0627.0652.02. No0.8239610.3224534580.02400.0
6100858Auburn UniversityALPublic, 4-year or aboveFour or more yearsPublicCity: SmallDegree-granting, primarily baccalaureate or aboveDoctoral Universities: Higher Research Activity20,000 and above-85.48825832.5993774737.050.018072.015168.04771.04836.02. No0.8393090.3188297860.05472.0
7100937Birmingham Southern CollegeALPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: MidsizeDegree-granting, primarily baccalaureate or aboveBaccalaureate Colleges: Arts & Sciences Focus1,000 - 4,999-86.85055533.5137751597.050.02559.01583.0349.0349.02. No0.6186010.2204677410.04940.0
8101189Faulkner UniversityALPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: MidsizeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Small Programs1,000 - 4,999-86.21640832.384182704.025.02335.01191.0311.0333.02. No0.5100640.2795973500.03900.0
9101435Huntingdon CollegeALPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: MidsizeDegree-granting, primarily baccalaureate or aboveBaccalaureate Colleges: Diverse Fields1,000 - 4,999-86.28436332.351032629.00.02074.01161.0294.0294.02. No0.5597880.2532305700.03800.0

Last rows

unitidinstnmstabbrsectoriclevelcontrollocaleinstcatc15basicinstsizelongitudlatituderoomcapapplfeeuapplcnadmssnenrlftenrltalloncamacceptyieldroomamtXboardamtX
1529488314Beth Medrash of Asbury ParkNJPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitSuburb: LargeDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.21081540.04370560.00.030.026.023.023.02. No0.8666670.8846151770.01180.0
1530488350Yeshiva Gedolah Shaarei ShmuelNJPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: SmallDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.19759440.09100090.00.0100.036.027.027.01. Yes0.3600000.7500001386.0924.0
1531488785University of Saint KatherineCAPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitSuburb: LargeDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-117.19654833.15128355.020.076.028.016.016.02. No0.3684210.5714299000.03600.0
1532488819The Colburn Conservatory of MusicCAPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: LargeDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-118.24984034.054070150.0120.0287.018.018.018.02. No0.0627181.00000012360.05879.0
1533489937Piedmont International UniversityNCPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: MidsizeDegree-granting, primarily baccalaureate or aboveSpecial Focus Four-Year: Faith-Related InstitutionsUnder 1,000-80.25015336.087963185.039.0256.093.064.065.02. No0.3632810.6989254310.42873.6
1534490319Yeshiva Bais AharonNJPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: SmallDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.20309440.10100250.00.023.018.013.013.02. No0.7826090.7222221400.0700.0
1535490504Yeshiva Ohr NaftoliNYPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitSuburb: LargeDegree-granting, not primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.04628841.456985100.0100.015.015.011.011.01. Yes1.0000000.7333332820.01880.0
1536490513Bais Medrash Mayan HatorahNJPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: SmallDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-74.20433040.10741469.00.030.025.021.021.02. No0.8333330.8400001680.01120.0
1537490805Purdue University NorthwestINPublic, 4-year or aboveFour or more yearsPublicSuburb: LargeDegree-granting, primarily baccalaureate or aboveMaster^s Colleges & Universities: Larger Programs10,000 - 19,999-87.47423641.584324744.025.04136.01434.01070.01125.02. No0.3467120.7845195595.02238.0
1538491057Yeshiva Kollel Tifereth ElizerNYPrivate not-for-profit, 4-year or aboveFour or more yearsPrivate not-for-profitCity: LargeDegree-granting, primarily baccalaureate or aboveNot applicable, not in Carnegie universe (not accredited or nondegree-granting)Under 1,000-73.99217240.63709628.00.015.014.012.012.02. No0.9333330.8571432000.03000.0